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aaim Amendments 

Claim L (ciirrently amended) A computcriTed storage and retrieval sysUnn ofbiological iofonnation 
cooqirising: 

a means fbr entering data [[mtry]J; 

a means for displaying Ihc data; ?oc^ro^ yw^a^ pa^T^C^ 

a programmable central processing unit /fer p crfonning automated analysis using semi-lineaf t>aihwav 
ne presentatioi ^ (<^f,^TPp) ^d mcthod^to compar e an d predict pathways and thtsir cotnponcnta and to es tablish 
a pathways database: [[and]] 

a drtMtefBge means for storing data: and 

a relational database containing annotated and ciiratcd protein pathways and annotat e d information on 
the pathways and their components ctPi e d in o relational datohflflc. whorcln tfi e pntfaw ny n nrn nrn i otnt a d and 

orggnia B od in a oumtod oluirtering ommgcment ond th e annotat e d information ic aneoouud through the- 
r e lotionttl datohnoe. 

Claim 2. (currently amended) The system of claim 1, wherein the information pertaining Co the pathways 
and their comp onents and [[is]) stored in a plurali^ of tables fiirth«- comprises compriiii u g proteins* their 
sequences and attributes; protein interactions; protein-protein associations; protein pathways; xnKNA, 
microanay, and protein expression data; genes, their sequences and attributes; [[and]] dcscripUons of cellSp 

* 

tissues, and organs[[,]]; paibology reports [[Jliand patient histories [[,]] and treatments. 

Qaim 3. (cuirently amended) The system of claim 1, wherein the central processing unit isyprogrammed to 
run an algorith m or a method for constrained clustering or an ontimization alEgorithm for dynamic 
proaramming and one or more fimctiona or standard methods selected from Tetrieve» input, edit, annotate, 
search, calculate similarities, align, compare, optimize using linear next-neifghbor oriteria or global 
y^'^Pi?fftti on criteria, and predict homologous or orlhologous ^^tem pathways and their comDonents and 
wherein the me thod for constrained clusterm|g is further selected from average linlca^ single linkaas. 
complrte ImlcRPf; K-means. or selfKirpinizinjg maps. 

Claim^ (currently amended) The system of claim 1, wherein the central processing um't is further 
prugiammcd to perfomi one or more analy^p selected from protein sequence analysis, protein interactions 
analysis, protein-protein association analysis, protein pathway analysis, gene expression analysis* pathway 
annotation analysis, pathway edit analysis, pa^way expression analysis^ tissue expression analysis, 
subtractive hybridization analysis, electronic northern analysis, anc| [(ot]| commonality analysis. 

Claim^5C (currently amended) The system of claim 1, wherein each the data i»entered using tho ctnndaixl for 
.pathway in the data base further comprises at least two nodes connected bv a mode rBprpscmta ^ro, 

2 
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Claim 6. (currently amended) I'he system of claim 5 [[1]], wherein q mooM for displttying the datn i a used to 
ohow two related pathways as a diagram containing nod e ti which repr e cant protems or non - prot e in mnl e cule s ; 
mod e o that r e pres e nt prot e in inteiactionB or protein prot e in accociationB: at least two p athwflYfr fMPft ffftmr" 
usmg scoT€s calculated from sequence idsQl^tSCt motif or structural homologies that interrelate nodes [[;]} and 
eroQk>Ying coefficients of simiiarity calculate^ or selected from pene exoression including clone counting of 
cDNAs. ESTs. promoter regio ns, or repulatorv sequences: electronic northe rn analysis, immunoprecip itation, 
microorrav data, quantitative real-time polymerase chain reaction, transcript imaf yje, smA yt^s t two hybrid 
datft; a Prumotwr winrilwitv mailriA, a prutcin-protcin asaucialiim matrix and a similarities lanim^ prouan 
interactiong matrix (SCIM matri;^^ that ttttorelate modes of the pathways. 

Clatn^. (cuiTcntiy amended) Hie system of claim 1, wherein the central processmg unit i^/^grammed to 
eofflpai^ perform pat rwise comparison of at least two protein pathways or their components selected from 
[[by a]] node-only, [[a]] mode-only, fiod [[or a]] node-and-mode comparison and wjhergin the node-only 
comparison is fujdfeig: selected from protein only, non-protein only, and protein and non-protein nodes. 

Claini^Sl (currently amended) The system of claim 1 [[1]], wherein the central processing unit is- 
profflnammod to run runs die [[an]] optimization algorithm for d^amic programming further conqmsing: 

a) initializing an array, in which a two dimensional army M'^Mfj with J rows of nodes and variant 
length for each row, the length for i-th row is ni is set up and Mji=0« where K^K^nj^ 

b) backfUltng tiw array via backward recursion with the formula 

Af4=rnaxjw{aa,£i^)i A^^^ forl<i:Sn., l^it^J 

where d(.) is the step ftmctton defined as t (v)»{0, if ik-KH 1, if y>0} and w(.,.) is die scoring frmctim 
between &e two nodes» defined as 



0, if i= j, aik= aji , aik= -D, or ^yi- -D 

vKlicrc D > 0» or 

^[Cik, j] - tcjo |cr(l - |sik - Sjlj) + (1 - a) Cik, jlj othcnvi^^e 



;and 



e) using tm l Doback to identify finding the best alignment to predict nodes of putative pathways PPWj, 
l<*5<-max ni with the top n best scores. 
Claims 9-^. (canceled) 

Claim 23. (currently amended) The system of claim 3 [[IJ], wherein die central processing unit is- 
to iun o runs the constrained clustering algorithm farther comprising: 
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a) assigning a distance between a protein in a aucrv pathwa y and each protein every ptt i f of piotoinB in 
the database; 

b) merging at each round the [ [two]] protein in the ouerv pathway and th e protein in the database with 
fl^ closest distance [[pairs]] into a cluster until uscr^ct threahold is met; and 

c) computing mean linkage between two clusters using 



whercf^tt]] n, is the size of c^ and i% the sisoe of c^ and the distance between the two clusters is a weighted 
average of the distance between two proteins i and j and a ftinction of similarity to queiy protein (si=SiqQ ^ 
Sjq(j)) where q(i) is the query protein homologous to database protein i and protetn-protein association 
(ajj) and 



4=a(l-^)+(H|^-^-| ^ 



Cbiin24. (canceled) 

^ • J 

Claim (currently ammded) fhe system of claim 7, wherein pnthw n y n n nl yo io n oo o a node-and-mode 
comparison predicts new homologous or orthol ogpus pathways bv oon i prioiugi 
o) eubmitting o quei^pothwoy and prot e in g e qu e no cs ; 

b) comparing node^ (^g an optimization algorithm for dynamic programming oraoonstratt^^ 

clusterin g^algorithm or standard methods and whoroin a aoqufmoo i dimti ly-^orBM m- p vnln a raiinmnr ig » ^ 
i iimilnrity and o w eighting factor between 0 and 1 io ooaignod to oorrospond i ng nod e& ; - 

o) wi iip uring modes by generattog a SdM matrix wher e in a containing coefficients of similari^ w 
aaoignod to ooTTogpondingmodcQ; 



o) gummmg oil ohiiilttrity coorou» end 

0 dicplaying at least one higj^oooring segm e nt of th e aligned pathways d e rived from nod e and -mede- 

OOaipgriOO D^thorebv poFfonning pathway nno^yay^ 

Claim 26. fCUrfCntly amended^ A process for The fiyntGrnnfolmiri 1^ whnrmnthn nnntml p a rwion^mg ^ pjt 

progRUDmod to perform analvadny or predicting novel ho moIo&oiLs or ortibolo^ous pathways analycis us tHfre- 
submitted query pathway and protoin ooquenoeo comprising: ^ 5v^$k^ dou V*^ I 

a) sub mitting a ouerv pathway and protem sequencey 

b) organimig the pathway and sequences usnig the standard for pathway rcpresontotiQa ; 
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fi>[[b)]] ieomparing each scQue nce protoin soQumiooD of the query pathway with each scouence ett- 
protein uoquciioegt ii> the pathways database using an^al^^orithm for constrainud clustering or an optimization 
algorithm for dvnamtc programming and standard meibods of pnitvin vunqjarison to caluulatc sequence 
klentiiv scores: 

d) Ufi)]] using a SCIM matrix derived from protein interac tion ^ or protein-protein association analyaia 
to compare coefficient of similarity fiir each interaction of the query pathway and each interactton 
inttfTOotiuna for prot e ins in a pathway in the pathways database; 

£) UA)]1 calculating an overatl-sao nji |^ty fic*i^^ (OS.Rcnm) based on sequence identity scores and 
coelGGcientS of similarity for each pattoyay: gng 

e ) removing all pathwayo not meeting a uoor cpooified - throohold for O S- aoor e ; and 

f) retrieving e^gasid pathways meeting a uscr-speoified [[the]] threshold fpy pS-floore> thereby 
aiia\y5»tii^g or predictinp novel homologous or orfeologous pathways Qnttlyoip > 



Claim 27. (currently amended) The system of claim ^113 jj» wherein pathwayc databQc a m fi a orohed for 

a) s ubmitting a query poAway t 

b) pcifonning protein interactions analysis between a [[the]] query pathway and each pathway [[all]] 
pathways in the pothwoyo database produces a wherein coefficient of similarity isproduo ed to intenvlate 
each mode of tte query pathway and a mode of the most closely related protein pathway [[;)] 
■ ■■ o> ■ retrieving at leost one pathway oHgnm e ntj and 

d) showing the olignmont booed on the coaroh for protein intorao ^ens. 

Claim 28. (canceled) 

Claim 29. (currently amended) J^f^ propc ^ of cla^ni^lik wherein ^ft o oyotom of okdm ' 3, wh e r e in o quety- 
patfaway to cearcb o pathways databas e to prediot the predicted oithologDus pothwoyo oomprijig gS' 



a) gubmittmg a quoiy pathway and known prot e in se quene e e t 
— b) comparing known s e quroc e o to a i l protein uoquoneoo ctored in the dotobooo; 

c) retrieving t>athwav further comprises orthologous proteins with the highest s^uence identity to die 

faiewn proteins in the ouerv pa^wav [[;]] 

d) — inharitiiig protein interactions inherited from the query pathway[[; and]] 

o) - ^l i gnin8 tho query path%vay and the orthologoufl protein s , thereby predioting orthologoufl pathways . 
Claims 30*36. (canceled) 
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